Skip to main content
TrustRadius
Amazon Redshift

Amazon Redshift

Overview

What is Amazon Redshift?

Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.

Read more
Recent Reviews

Redshift trumped Hive

9 out of 10
January 15, 2021
Incentivized
It is used within a few departments. It is used to solve certain legacy problems that have not yet been ported over to other more suitable …
Continue reading
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Pricing

View all pricing

Redshift Managed Storage

$0.24

Cloud
per GB per month

Current Generation

$0.25 - $13.04

Cloud
per hour

Previous Generation

$0.25 - $4.08

Cloud
per hour

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services
Return to navigation

Product Demos

ETL From Amazon RDS to Amazon Redshift with using AWS Glue Service

YouTube

Introduction to Query Scheduler for Amazon Redshift

YouTube

ETL From AWS S3 to Amazon Redshift with AWS Lambda dynamically.

YouTube

Amazon Redshift Tutorial | AWS Tutorial for Beginners | AWS Certification Training | Edureka

YouTube
Return to navigation

Product Details

What is Amazon Redshift?

Amazon Redshift Technical Details

Deployment TypesSoftware as a Service (SaaS), Cloud, or Web-Based
Operating SystemsUnspecified
Mobile ApplicationNo

Frequently Asked Questions

Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.

Reviewers rate Usability highest, with a score of 10.

The most common users of Amazon Redshift are from Mid-sized Companies (51-1,000 employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(206)

Attribute Ratings

Reviews

(1-19 of 19)
Companies can't remove reviews or game the system. Here's why
Score 10 out of 10
Vetted Review
Verified User
Incentivized
Amazon Redshift is our Data Warehouse, where we store our processed data (Hot data) for various initiatives like BI, Analytics, DataScience, etc

We also use Amazon Redshift Spectrum as our Data Lake, where we store raw (un-processed) data (Cold data) for historical analysis, trends, etc

We store various standard data in Redshift like:
Bronze (ETL-ed data),
Silver (Materialized Views data), and
Gold (Rollups/Aggregated/Dashboard-ready data) in [Amazon] Redshift





  • [Amazon] Redshift has Distribution Keys. If you correctly define them on your tables, it improves Query performance. For instance, we can define Mapping/Meta-data tables with Distribution-All Key, so that it gets replicated across all the nodes, for fast joins and fast query results.
  • [Amazon] Redshift has Sort Keys. If you correctly define them on your tables along with above Distribution Keys, it further improves your Query performance. It also has Composite Sort Keys and Interleaved Sort Keys, to support various use cases
  • [Amazon] Redshift is forked out of PostgreSQL DB, and then AWS added "MPP" (Massively Parallel Processing) and "Column Oriented" concepts to it, to make it a powerful data store.
  • [Amazon] Redshift has "Analyze" operation that could be performed on tables, which will update the stats of the table in leader node. This is sort of a ledger about which data is stored in which node and which partition with in a node. Up to date stats improves Query performance.
  • Amazon Redshift is a Managed Service. But it is Not a 100% managed service. We still need to configure it with WLM (Work Load Management) settings, and add Query Queues to make sure it's resources aren't wasted and it is performant at it's best state, all the time
  • [Amazon] Redshift has a concept of "Vacuum", which is an operation to claim the disk space back from deleted data/tables. They recently started doing automated vacuuming. Prior to that we had to do that at regular intervals, to claim the data back.
[Amazon] Redshift is suited for various use cases like Time series data, Structured / relational data, Semi structured data like JSON, etc.

[Amazon] Redshift might not work 100% well with full performance, for Graph DB use cases.
Score 10 out of 10
Vetted Review
Verified User
Incentivized
We have decided to purchase Amazon Redshift since we started the project of building a new "data lake," so the first step was to decide which tool would be more appropriate to use as a data warehouse. Since we have everything on the cloud, we choose Amazon Redshift to connect our current tools on AWS and integrate the data.
  • Data integration is very simple to perform
  • The tool provides some advice that is very useful
  • Their support is always complete and easygoing
  • Their documentation could be even better
If you are looking for something easy to implement that will give you a nice performance, I would suggest Amazon Redshift. I'm using it in AWS environment, so I don't know if in another cloud environment the performance and all the features would be nice as well. It's also important to check if the price fits to you too.
Score 9 out of 10
Vetted Review
Verified User
Incentivized
We use Amazon Redshift for our insights platform in our R&D space. Our team creates reports and dashboards on tools for business use. Amazon Redshift provides greater supply chain visibility, increased information on product movement, and high efficiency at a much faster rate.
  • Robust as compared to traditional database/data warehouse
  • Offers significant query speeds
  • Low cost of ownership
  • Provides MPP only for AWS-supported storages
  • Prerequisites for configuring tables are not easy
  • Not great for use with web apps
Amazon Redshift performs extremely well for reporting/analytics data and is way ahead of other competitors. The biggest challenge is migrating data from on-premises databases to Amazon Redshift. The initial hurdle is a major one.
Bojan Sovilj | TrustRadius Reviewer
Score 10 out of 10
Vetted Review
ResellerIncentivized
Cloudwalker offers analytic services for the gambling industry. The gambling industry has vast amounts of data that are high speed and variability. Our services from Redshift help gambling companies have better control of their bookmaking product, have a complete view of customers betting history, helps with detecting problematic accounts, etc.
  • Redshift has concurrency scaling helps serve more customers queries
  • Redshift has automatic table compression having less disc space consumed comparing to other data warehouse solutions
  • With ra3 new node types we can separate storage and compute
  • Having automatic vacuum delete helps having conzisent performance in cases where data variability in dwh production zone is present
  • Consistent service improvements from AWS: temporary tables, null handling in joins, single row inserts, materialized views
  • Frequent changes of management console look and feel
  • Automatic vacuum sort doesn't work for several billion rows tables
  • Disc IOPS performance monitoring excluded
Redshift is great data warehouse solution if you have several billion rows tables. More than 200 very important improvements were added in several years' time. With new Redshift instance types solution has separation of storage and compute and magnitude better query response times. Don't use Redshift if you have less than several billion rows tables.
Jay Padhya | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Incentivized
We are performing a POC (proof of concept) across multiple cloud vendors, and we have evaluated GCP, Azure, and Amazon. We plan to go on-site cloud from on-premises, and we will evaluate all databases across all cloud providers. We are making the current database over Amazon Redshift, if it can help us do the exact same job as we have on-premises SQL Server. The on-prem option is good for replication and security, and we are evaluating how good it can be on ML application support.
  • Robust
  • Great UI
  • Price
  • Implementation in non AWS server
Except for the price, Amazon Redshift is a great tool and has the fastest performance across all the data warehouses we have seen. It's easy to connect with Talend, which makes it a better option to use. I like the UI better than most of the other DW. Overall it's a great DW tool.
November 30, 2019

Redshift Review

Score 2 out of 10
Vetted Review
Verified User
Incentivized
Amazon Redshift was our enterprise data warehouse as a backend to our BI solutions.
  • Fixed cost.
  • Tunable table design.
  • Need to provision warehouse for highest capacity.
  • No real separation between computing and storage (even when considering Spectrum).
  • All users share the same infrastructure resulting in frequent 100% utilization error messages.
  • A leader node can become a bottleneck for too many concurrent aggregate queries.
Redshift is appropriate when the number of concurrent users are low and pointed queries are the focus. It is not appropriate when a large number of concurrent users is to be supported,
September 27, 2019

Redshift is way too easy!

Score 9 out of 10
Vetted Review
Verified User
Incentivized
Redshift is our data warehouse used by our organization. It takes data from different sources and put them together in Redshift for our Analytics team to diagnose.
  • Since it's part of AWS it is fairly quick and easy to set up.
  • You can add nodes fairly quick to expand the data needs.
  • Performance from the analytics reports accessing Redshift is really good.
  • Better database management when looking up table metadata or sizes of tables.
  • Need a better query analyzer.
  • Finding errors during a data load can be a little daunting at times.
It's very cost-effective from other databases we were using for our data warehouse. It was really easy to set up and it used our ETL tools to migrate data from different data sources. We added functionality add aggregate the data set for our Analytics team to analyze.
Akshaya Bhardwaj | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Incentivized
Amazon RedShift is used in all departments and accounts. This tool is newly integrated into the system so as to work with the data on the cloud. There are various projects which are moving from SQL servers to ARS because of its capacity of working and managing the data in the cloud.
  • We can connect with multiple servers and can fetch the data easily from one server to other.
  • It supports the syntax of the bots of the SQL servers, MS SQL and Oracle SQl. This makes it pretty handy to use.
  • Here we use views instead of tables, so we can clearly see the flow of data.
  • It works very slow in the cloud environment.
  • No statistical inbuilt functions are available within the tool.
  • Its user interface is not very attractive.
  • Often it goes into deadlock state, which kills the running jobs.
Amazon Redshift is well suited for fetching data from multiple different sources and servers, and it is very easy to learn how to use this tool. It is less appropriate for situations where you may need to process only limited rows, as this takes a large amount of time since you cannot create multiple tables in it.
Score 7 out of 10
Vetted Review
Verified User
Incentivized
AWS Redshift is the cloud-based data warehouse where we store our application level datasets and is used further for insights from the stored dataset. It improves the decision support for our business based on data analytics on a large set of real-time datasets which force the business processes to the next level. It provides good performance with high availability for essential data analysis and valuable intelligence.
  • Easy query and fast execution
  • High performance and availability
  • Support of large datasets
  • Scalable solution
  • Database optimization
  • Time consuming process for schema design and modification
  • Integration is little bit difficult
Amazon Redshift is the data warehouse under the umbrella of AWS services, so if your application is functioning under the AWS, Redshift is the best solution for this. For large amounts of data, the application is the best fit for real-time insight from the data and added decision capability for growing businesses. If your application is outside of AWS it might add more time in data management.
Score 10 out of 10
Vetted Review
Verified User
Incentivized
Amazon Redshift is being used as our primary analytic data-warehouse. This allows our data and analytic team to build report and query data without going directly to our production database. It is a central data repository from external data sources as well, data we import from 3rd parties and segment.
  • Complex queries
  • Aggregation
  • Fully managed service
  • Works very well with most BI/reporting solutions
  • Stored procedures
  • Job scheduling
  • A easier way (perhaps a GUI) to manage users permission
Amazon Redshift is great for analytics, reporting, and complex queries for statistical modeling and machine learning. Its ability to run parallel queries in a simple SQL environment makes the transition from traditional DB very easy. Very good for loading/reading/writing large datasets. I would not recommend RedShift for an environment that requires single row reads/updates, which it is not optimized for.
Jacob Biguvu | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Incentivized
Capital One has many LOBs (line of businesses). I have supported IAM and Commercial LOB. They are using Redshift as a data warehouse solution. Oracle is not a Data Warehouse solution but was being used when the application was on on-premises. When they wanted to migrate all data to the Cloud, they chose Redshift as a solution to move the data from Oracle. Oracle is not a data warehouse solution. Redshift has been found as a good solution because of its unique features such as its MPP architecture, columnar architecture, and storage capacity.
  • User-experience. The user wants something quick to view the output, rather than spending too much time
  • preparing a code prior to seeing the output. Redshift provides SQL type queries. This makes any user happy and comfortable.
  • Architecture is very straightforward and simple to understand, such as MPP architecture, Encryption, and Columnar database design. We can easily address issues and help others to understand.
  • Scalability. We can scale-up and scale-down based on our workloads.
  • Performance tuning and database optimization can be done using the system tables and advisors. These solutions are similar to the solution available for Oracle SQL Server. It makes it easy to do the optimization for queries and databases.
  • The concurrency and scale up based on it could be improved. It would be good if it scale-up and scale-down the memory/CPU capacity automatically based on workload.
  • Often we experience slow on queries and dashboards. Self-tuning option in WLM does help.
  • Optimizing the areas such as Vacuum and reorganize the column data (sorting over time) automatically.
Amazon Redshift is good for when you need a Data Warehouse solution or a user-experience such as SQL kind queries. It is also good if you have limited budget constraints. It's not suggested when someone has DML queries such as INSERT, UPDATE, DELETE.
Score 9 out of 10
Vetted Review
Verified User
Incentivized
We use it as our data warehouse for reporting purposes.

My company used it solely for reducing the performance overhead of running long SQL queries. The seamless implementation of Redshift allowed us to get the data ready to go for our customers to run the reports they need. It is currently used by a few customers, but we are trying to get each of our customers to use this rather than using the traditional OLTP database.
  • Easy to work with
  • Seamless implementation with matillion
  • Massive data reads and inserts
  • I didn't like the security aspect of this where it asks us to create views for each customer.
  • It does not support row-level controls.
  • Some SQL queries are faster on native SQL than here. But it could be the data conversions that is causing it.
[It's] very good for reports and dashboards and is easy to use and fast performance makes it a good choice. It excels in columnar architecture and aggregates. If you have many clients as we do, you need to have separate schemas for each of them. That is a good way but also there is too much clutter spread all over. That is the only drawback I see, end users can see data in the schema, there are no individual permissions allotted.
Score 8 out of 10
Vetted Review
Verified User
Incentivized
Amazon Redshift is being used as the main data warehouse across the whole organization. It provides essential business performance data and valuable intelligence. Different departments utilize it for a variety of operations, and decision makers rely on the data analytic tools built upon its data for tracking and enhancement, wherever needed.
  • It does very well in data ingestion, and compresses data efficiently.
  • Most of the queries return results quickly even with large data sets.
  • It has a hard limit of total number of concurrent connections to the database. Compared with conventional databases that limit is very low.
  • Its workload management (WLM) mechanism could be improved, such as made more dynamic and easier to tune and manage.
It is best for large structured data sets with data normally loaded from files, instead of through heavy SQL data manipulation. If there are SQL data manipulation, better to have mainly DDL (create/drop/alter) instead of DML (insert/update/delete) to achieve the same goals. It is also better for smaller number of concurrent connections (less than 500).
Kyle Reichelt | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Incentivized
Redshift is currently being used to house normalized client data pulled from various third-party endpoints. It houses the data that is both being accessed directly by our business intelligence and CRM platform, as well as made available via our own API gateways. It was chosen for its ability to support a "big data" environment with high availability.
  • If you need draw insights from immense amounts (see: petabytes) of transactional (repetitive) data in near real time--think machine learning and business intelligence--and you're already in the AWS ecosystem, then it's your only real option. It performs very well.
  • Highly configurable, intelligent compression of repetitive columns reduces your memory footprint, lending to extremely high performance.
  • As with most things in the AWS ecosystem, it scales seamlessly and endlessly.
  • There is no support for data de-duplication; meaning this has to be either accounted for upstream, or you'll have to build your own services to de-dupe your data.
  • It's strength is housing data, not necessarily data insertions. While it has an SQL-like interface, it shouldn't be approached the same as a typical relational database.
  • Permissions can be a pain... dovetailing on my previous "con" , in some instances it's easier to drop/rebuild a table than try to navigate incremental updates/insertions, but retaining user-permissions is a pain-point.
It is well suited for:
  • Petabytes of data requiring near real-time analysis
  • Massive Data Insertions
  • Massive Data Reads
It isn't well suited for:
  • Web apps
  • Smaller transactional inserts
  • Smaller reads
You wouldn't drive an 18-wheeler to the corner store to pick up a bag of chips. Your specific need will determine whether or not Redshift is suited for the job.
Gavin Hackeling | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Incentivized
We use Amazon Redshift and Redshift Spectrum for our data warehouse. Our production transactional datastores are continuously replicated to Redshift and transformed into fact tables. Redshift is maintained by the data team, but it is used by analysts on most teams, including business intelligence, product, and customer support. Redshift is our source of truth; it provides information about business processes that the team needs to make decisions.
  • Redshift is fully managed. Small teams do not have the resources to maintain a cluster. CloudWatch metrics are provided out-of-the-box, and it is easy to configure alarms.
  • Redshift's console allows you to easily inspect and manage queries, and manage the performance of the cluster.
  • Redshift is ubiquitous; many products (e.g., ETL services) integrate with it out-of-the-box.
  • Writing .csvs to S3 and querying them through Redshift Spectrum is convenient.
  • We've experienced some problems with hanging queries on Redshift Spectrum/external tables. We've had to roll back to and old version of Redshift while we wait for AWS to provide a patch.
  • Redshift's dialect is most similar to that of PostgreSQL 8. It lacks many modern features and data types.
  • Constraints are not enforced. We must rely on other means to verify the integrity of transformed tables.
Redshift is ideal for small teams. It is fully managed. CloudWatch metrics are provided out-of-the-box, and it integrates well with other AWS products, such as DMS. The Redshift console is among the better AWS consoles. Redshift offers adequate performance. Spectrum offers a convenient way to access our data lake, but we have encountered issues with recent versions.
Score 10 out of 10
Vetted Review
Verified User
Incentivized
We used the Amazon Redshift for Analytics Data Warehousing. It helped to process our various departments in organization like renewals, sales, marketing & finance department to analyze the data very quickly and performance effective with tableau reporting tool.
  • It's a columnar data storage architecture and which allows it to particularly run structured data query for reporting very fast.
  • We used amazon redshift cloud datawarehouse with Tableau, looker reporting tool and it has perfectly helped our reporting needs for business users.
  • Very easy to copy data from Amazon Web Services S3 storage container to Redshift Database with simple copy statements.
  • It provides built-in commands to table structure effectively with less use of memory.
  • AWS can provide some cheaper options with pre core cpu purchase rather than hourly charges on amazon redshift.
  • There are no options for on-premise set-up of the amazon redshift database.
  • Limited documentation on best practices for dist key, sort key and various amazon redshift specific commands.
It's the best option when we need to have a high volume of structured data analytics datawarehouse design & development. It perfectly reports fast with tableau reporting tool, even data around 300 million records. It's best suited where the organization is planning to build a custom datawarehouse rather than using any pre-packaged BI Apps data model.
Vinaybabu Raghunandha Naidu | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Incentivized
Redshift is one of the most scalable yet simple data warehousing tools. It is not only used within the engineering team but widely used by the business team too. One of the major use cases are to power the dashboards and reports which are in looker. It helped to migrate from the traditional email reports to dynamic dashboards and reports which can be shared to various end users and businesses. It helped in faster data retrieval and can scale well with ease.
  • Scalability with less downtime.
  • Performance optimization without affecting the business i.e zero downtime during optimization
  • Powerful yet simple to use. Very easy to optimize and improve performance during the regular read/write operations
  • No need of DBA to operate and maintain Redshift as it is a completely managed data warehousing tool
  • Good customer support and will respond in very quick time with clear information and documentation
  • Well documented commands
  • Can propose better optimization techniques based on the business use case.
  • Can provide option to set the upper bound on number of connections to cluster
  • Can improve on optimized writes/updates
Well suited for faster data retrieval and powering OLTP need and perfectly suited for generating dynamic dashboards and reports.
Not suited for massive data storage and analytics. Can-not handle unstructured data.
Michael Romm | TrustRadius Reviewer
Score 6 out of 10
Vetted Review
Verified User
Incentivized
Amazon Redshift is being used by many business units within our company. It is our new data warehousing platform.
  • Redshift seems to be as fast processing a large dataset as it is with a small one. It seems, when the dataset size is significantly increased (10x, 100x, 1000x, etc.), DML queries are often executed within the same amount of time.
  • Redshift has a powerful graphical admin tool to monitor the ongoing queries in real time and historically.
  • Easily expandable capacity. Automatic snapshots that eliminate the need for managing backups. Simple database maintenance strategies with the VACUUM and ANALYZE commands.
  • Abundance of detailed documentation and tutorials.
  • It could benefit from adding data integrity and programming tools common to other database management systems.
  • Amazon Redshift is based on PostgreSQL 8.0.2. That version of PostgreSQL was released in December 2006. While PostgreSQL was much improved since then, the new features were not implemented in Redshift. Many basic features are missing from it.
  • Primary keys can be declared but not enforced. Referential integrity (foreign keys) can be declared but not enforced. UNIQUE and CHECK constraints are not supported and cannot be declared.
  • IDENTITY can be declared on a column, and Redshift will put unique values into it. However: IDENTITY values in the newly inserted rows won’t be incremental or sequential. To implement a sequential number, you need to write your own custom code.
  • There are no stored procedures in Redshift. We are writing SQL script files, and then parsing and running them one statement at a time from a Python program. This also enabled us to implement execution-time error logging.
  • In SQL scripts, to check for the row count of affected rows, a complicated join query against some system tables or views has to be executed.
  • Data Control Language (DCL) does not exist. No statements like IF, WHILE, DO, RAISERROR, etc.
  • On performance of views… Views do not “pass-through” a query parameter which is a potential problem for performance.
  • When selecting against a view with the WHERE clause outside of the view, the inner query of the view will be executed first without consideration for the WHERE clause, and only then the WHERE clause will be applied.
  • Certain clauses of SQL work many times faster than other clauses. So be careful and test your statements for performance earlier rather than later, especially if working with a large data set.
  • There was a situation when DELETE FROM JOIN was unacceptably slow. Replacing JOIN with the USING clause made DELETE instantaneous.
Redshift is a viable platform to house a large or very large data warehouse designed for performance and scalability. It is especially well-suited in the cases where your source data is already stored inside of the AWS services infrastructure.
Score 7 out of 10
Vetted Review
Verified User
Incentivized
(JSON) events emitted from platform/web services are transformed and loaded into AWS Redshift in order to support analysis and reporting for our solution.
  • AWS infrastructure and support simplifies maintenance and administration
  • familiarity with PostgreSQL makes adopting Redshift as a column store easier
  • columnar data store allows for high performance queries on large volumes of data
  • there are some situations where having a column store more closely integrated as part of our platform would be better
  • AWS costs can add up
  • many other (open source) column stores have new and interesting features not (yet) available in Redshift
If you want an easy way to get started with a column store, spin one up on AWS and see if it fits your use case. AWS is a reasonably cheap way to adopt new technologies. Then after a while, you'll be in a better position to decide whether to commit more to AWS or choose from comparable technologies available.
Return to navigation